Aggregate-Query Processing in Data Warehousing Environments

نویسندگان

  • Ashish Gupta
  • Venky Harinarayan
  • Dallan Quass
چکیده

In this paper we introduce generalized projections (GP s), an extension of duplicateeliminating projections, that capture aggregations, groupbys, duplicate-eliminating projections (distinct), and duplicate-preserving projections in a common uni ed framework. Using GP s we extend well known and simple algorithms for SQL queries that use distinct projections to derive algorithms for queries using aggregations like sum,max,min, count, and avg. We develop powerful query rewrite rules for aggregate queries that unify and extend rewrite rules previously known in the literature. We then illustrate the power of our approach by solving a very practical and important problem in data warehousing: how to answer an aggregate query on base tables using materialized aggregate views (summary tables).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A data warehousing approach for building recommender systems

A data warehousing approach for recommender systems is proposed. We sketch an architecture for integrated OLAP and data mining in data waxehousing environments, and argue why this architecture can be extended for building recommender systems. Since producing recommendations can be considered as conceptual query answering, the relationship between conceptual query answering and intensional answe...

متن کامل

The Cubetree Storage Organization

The Relational On-Line Analytical Processing (ROLAP) is emerging as the dominant approach in data warehousing. In order to enhance query performance, the ROLAP approach relies on selecting and materializing in summary tables appropriate subsets of aggregate views which are then engaged in speeding up OLAP queries. However, a straight forward relational storage implementation of materialized ROL...

متن کامل

Physical Data Warehouse Design on NoSQL Databases - OLAP Query Processing over HBase

Nowadays, data warehousing and online analytical processing (OLAP) are core technologies in business intelligence and therefore have drawn much interest by researchers in the last decade. However, these technologies have been mainly developed for relational database systems in centralized environments. In other words, these technologies have not been designed to be applied in scalable systems s...

متن کامل

Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing

Small Materialized Aggregates (SMAs for short) are considered a highly flexible and versatile alternative for materialized data cubes. The basic idea is to compute many aggregate values for small to medium-sized buckets of tuples. These aggregates are then used to speed up query processing. We present the general idea and present an application of SMAs to the TPC-D benchmark. We show that explo...

متن کامل

The Yin and Yang of Processing Data Warehousing Queries on GPU Devices

Database community has made significant research efforts to optimize query processing on GPUs in the past few years. However, we can hardly find that GPUs have been truly adopted in major warehousing production systems. Preparing to merge GPUs to the warehousing systems, we have identified and addressed several critical issues in a threedimensional study of warehousing queries on GPUs by varyin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995